Abstractive Multi-document Summarization by Partial Tree Extraction, Recombination and Linearization
نویسندگان
چکیده
Existing work for abstractive multidocument summarization utilise existing phrase structures directly extracted from input documents to generate summary sentences. These methods can suffer from lack of consistence and coherence in merging phrases. We introduce a novel approach for abstractive multidocument summarization through partial dependency tree extraction, recombination and linearization. The method entrusts the summarizer to generate its own topically coherent sequential structures from scratch for effective communication. Results on TAC 2011, DUC2004 and 2005 show that our system gives competitive results compared with state-of-the-art abstractive summarization approaches in the literature. We also achieve competitive results in linguistic quality assessed by human evaluators.
منابع مشابه
Partial-Tree Linearization: Generalized Word Ordering for Text Synthesis
We present partial-tree linearization, a generalized word ordering (i.e. ordering a set of input words into a grammatical and fluent sentence) task for text-to-text applications. Recent studies of word ordering can be categorized into either abstract word ordering (no input syntax except for POS) or tree linearization (input words are associated with a full unordered syntax tree). Partial-tree ...
متن کاملAbstractive News Summarization based on Event Semantic Link Network
This paper studies the abstractive multi-document summarization for event-oriented news texts through event information extraction and abstract representation. Fine-grained event mentions and semantic relations between them are extracted to build a unified and connected event semantic link network, an abstract representation of source texts. A network reduction algorithm is proposed to summariz...
متن کاملAbstractive Multi-document Summarization with Semantic Information Extraction
This paper proposes a novel approach to generate abstractive summary for multiple documents by extracting semantic information from texts. The concept of Basic Semantic Unit (BSU) is defined to describe the semantics of an event or action. A semantic link network on BSUs is constructed to capture the semantic information of texts. Summary structure is planned with sentences generated based on t...
متن کاملMulti-Document Abstractive Summarization Using ILP Based Multi-Sentence Compression
Abstractive summarization is an ideal form of summarization since it can synthesize information from multiple documents to create concise informative summaries. In this work, we aim at developing an abstractive summarizer. First, our proposed approach identifies the most important document in the multi-document set. The sentences in the most important document are aligned to sentences in other ...
متن کاملOpen-Domain Multi-Document Summarization via Information Extraction: Challenges and Prospects
Information Extraction (IE) and Summarization share the same goal of extracting and presenting the relevant information of a document. While IE was a primary element of early abstractive summarization systems, it's been left out in more recent extractive systems. However, extracting facts, recognizing entities and events should provide useful information to those systems and help resolve semant...
متن کامل